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The quantum states of two laser pulses — coherent states — are never mutually orthogonal, making 
perfect discrimination impossible. Even so, coherent states can achieve the ultimate quantum limit 
for capacity of a classical channel, the Holevo capacity Attaining this requires the receiver to make 
joint-detection measurements on long codeword blocks, optical implementations of which remain 
unknown. We report the first experimental demonstration of a joint-detection receiver, demodulating 
quaternary pulse-position-modulation (PPM) codewords at a word error rate of up to 40% (2.2 dB) 
below that attained with direct-detection, the largest error-rate improvement over the standard 
quantum limit reported to date. This is accomplished with a conditional nulling receiver, which uses 
optimized-amplitude coherent pulse nulling, single photon detection and quantum feedforward. We 
further show how this translates into coding complexity improvements for practical PPM systems, 
such as in deep-space communication. We anticipate our experiment to motivate future work towards 
building Holevo- capacity- achieving joint-detection receivers. 



One of the most important insights of quantum physics 
in the modern theory of optical communication is the re- 
alization that it is impossible to perfectly discriminate 
states of light that are not mutually orthogonal. While 
orthogonal quantum states of light {IV^/e)}, k = 1,2,..., 
i.e., whose inner products satisfy (ipkl'ipj) = S^j^ can in 
principle be discriminated with zero probability of error, 
such states of light are hard to create [27 . An ordinary 
laser pulse is in a coherent state, \a) (where a is a com- 
plex number denoting the mean field value). However, 
no two coherent states — even those in orthogonal space- 
time field modes — are ever in mutually orthogonal quan- 
tum states, i.e., {a\f3) = exp [a*f3 - \ + 0^ 
precluding perfect discrimination. In spite of this ap- 
parent impairment, coherent states are sufficient to at- 
tain the ultimate (Holevo) capacity of optical communi- 
cation, even on a lossy channel [2], a channel on which 
mutually-orthogonal quantum states such as Fock states 
actually fail to achieve the Holevo capacity. Recently, we 
discovered the first Holevo-capacity achieving codes [12] , 
and some advances on the optimal receiver measurements 
have been reported in recent literature [8] ITTI IT2l . How- 
ever, designing and building such optical receivers remain 
elusive. 

Helstrom gave a recipe to compute the minimum 
achievable average probability of error to discriminate 
quantum states from a given ensemble [1 , known as the 
Helstrom limit. He also found necessary and sufficient 
conditions on the operators describing the optimal mea- 
surement that achieves that minimum. Unfortunately, 
very little is known in general about how to build re- 
ceivers achieving this limit using laboratory optics. The 
binary-hypothesis Dolinar receiver can attain it in the 
case of discriminating two coherent states [5 . This was 
demonstrated only recently, three decades after its inven- 
tion [6], owing to the difficulty of realizing the ultrafast 
electro-optic feedback and high-bandwidth single-photon 



detection it requires. In addition, several sub-optimal re- 
ceiver structures have been proposed which bridge some 
of the gap between the standard quantum limit (SQL) — 
the limit of conventional optical detection strategies for 
the discrimination tasks — and the Helstrom limit. Exam- 
ples include Kennedy's [13 and Takeoka et al.'s p!4[ [T5] 
receivers to discriminate a binary coherent state constel- 
lation and Bondurant's receiver [16] for the quaternary 
phase-shift-keying constellation. In [20 discrimination 
was performed better than the SQL obtainable with per- 
fect quantum efficiency detectors. 

While all these receivers surpass the SQL for opti- 
cal state discrimination, they share the feature of acting 
only on single symbols of the modulation alphabet at a 
time. However, it is known that joint quantum measure- 
ments across multiple symbols are required to achieve 
the Holevo capacity [3 [181 [Hi- The conditional pulse 
nulling (CPN) receiver to discriminate pulse-position- 
modulation (PPM) codewords at a word error rate be- 
low the SQL (direct detection in this case), was the first 
structured optical receiver proposal to employ a joint de- 
tection strategy [10] , the first implementation of which is 
the key accomplishment reported in this article. There 
have since been several discussions of joint measurements 
acting on codeword blocks of symbols that can attain fun- 
damentally higher capacity than the best single symbol 
detector, dubbed "super-additive capacity" |18j[T9]. We 
recently proposed an improved version of the CPN re- 
ceiver [9 , some of the first explicit optical realizations 
of joint-detection receivers |7J [Q, and the first explicit 
Holevo capacity achieving codes [T2j . 

Traditional PPM demodulation uses direct detection 
(DD) of each symbol in the codeword. We denote the 
quantum state for a single-mode coherent state pulse 
with a mean of Np photons by the quantum ket vector 
I a) with a = ^JW^e^^ . Ideal direct detection (photon 
counting) of this pulse generates clicks in a Poisson point 
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process. Using a single photon detector (SPD) results in 
a click occuring or not with probability Pi = 1 — e~^^p 
and Pq = e~^^p respectively, where r] denotes the quan- 
tum efficiency of the detector. For discriminating M-ary 
PPM codewords (M codewords, each being a sequence 
of M coherent state pulses, of the form |0) . . . |a) . . . |0), 
|q/|2 = TVp, of M "pulse positions"), this translates to an 
average error probability Pe^^^ = ((M — 1)/M)e~^^p . 

The CPN receiver improves upon this by combining 
two strategies: (1) manipulation of the quantum (Pois- 
son) noise statistics via coherent pulse nulling; and (2) 
updating the nulling strategy in each time slot based on 
detection events in previous time slots. The first of these 
is essentially the advantage realized in the single-symbol 
Generalized Kennedy (GK) receiver [141 [IS] discrimi- 
nating two coherent states. A "nulling" pulse — hc) 
is mixed with the incoming signal pulse \a) on a highly 
asymmetric beam-splitter of transmissivity z^: ~ 1, result- 
ing in a displacement of the signal \a) to \a — P). The 
key insight of a nulling based receiver is that the "on- 
ofF' detection probability of a coherent state stemming 
from the Poisson noise of direct detection is a function 
of its intensity, and can be adjusted via shifting the en- 
tire modulation constellation. Fig. [TJa) illustrates this 
for the binary on-off-keyed (OOK) modulation alphabet, 
where the Generalized Kennedy receiver chooses a dis- 
placement P to minimize the average probability of er- 
ror in discriminating |0) and \a). As an example, for 
a = 0.2, Pe,DD = 0.480, whereas Pe,GK = 0.415 (at- 
tained for P = —0.61 or 0.71). This is remarkably close 
to the Helstrom limit, Pe,min = 0.401. A similar receiver 
concept has been proposed for the 4-ary phase- shift-keyed 
(4PSK) alphabet fW. 

The second key feature of the CPN receiver is the 
joint detection strategy implemented by utilizing quan- 
tum feed-forward on successive pulse slots based on de- 
tection results from previous time slots. In contrast to 
the Dolinar receiver, which requires instant intra-pulse 
feedback to perform optimally, the CPN receiver only 
requires inter-pulse feed-forward at a speed commensu- 
rate with the pulse repetition rate. Using the previous 
pulse click history allows the receiver to update the poste- 
rior probabilities of the hypotheses and choose a nulling 
strategy that minimizes the final codeword probability 
of error. Specifically, the receiver operates by nulling the 
M pulse slots sequentially per the decision tree shown 
in Fig. [ijb). It starts by applying an exact nulling op- 
tical pulse {P = a) to the first pulse slot and detect- 
ing the nulled slot by an SPD. If a click is not observed 
in that slot, the receiver continues to believe that the 
pulse was in fact present in that first time slot, and pro- 
ceeds to direct detect the remainder of the pulse slots 
without nulling. However, if the first slot did generate 
a click, the first slot is rejected as a hypothesis after 
which the receiver is faced with an M — 1-ary version 




FIG. 1: (a) An on-off- keying coherent state alphabet 
{|0),|a)} is shifted to {\ — /3) , \a — /3)} , by mixing the in- 
put coherent state on a beam-splitter with near-unity trans- 
missivity (hv = 0.99) with a local coherent state l/S/y/l — k). 
The shifted state is direct-detected with a single photon de- 
tector (SPD), resulting in an average probability of error 

Pe,GK = min/3 I l^e"*^"^"^-*^ + (1 — e~^^)j . In the special case 

of "perfect nulling" (/3 = a), the output constellation is 
{| — a), |0)}, which has an effect of swapping the on and off 
probabilities. This (Generalized Kennedy) receiver is a build- 
ing block for the conditional pulse nulling (CPN) receiver to 
demodulate PPM codewords, in which the click records ob- 
tained in previous pulse slots determine the nulling amplitude 
/3 on the current slot, (b) The binary decision circuit for the 
CPN receiver, showed here for the 4-ary PPM case. The 
numbers in yellow represent the receiver's current hypothe- 
ses, whereas the numbers in green denote the final decision (k 
denoting the pulse position where a click was detected). The 
receiver starts out by hypothesizing that code- word 1 was sent 
and nulls the first time slot in order to displace the hypothe- 
sized pulse to vacuum. A no-click event (upper-branch) sup- 
ports the correct hypothesis and the receiver reverts to direct 
detection thereafter. A click (lower branch) indicates an in- 
correct hypothesis and the receiver then nulls time slot two. A 
similar series of binary decisions occurs at each time slot, (c) 
Average probability of error for 4-ary PPM codeword discrim- 
ination versus mean photon number per pulse Np, using direct 
detection (blue), the perfect nulling CPN receiver (red:solid), 
the optimal nulling CPN receiver (red:dashed), and the Hel- 
strom bound (black). Note that the error exponent of the 
Helstrom limit (e"^^^) is twice that of the direct-detection 
receiver (e"'^^). 



of the original discrimination task. It moves to the sec- 
ond slot, nulls it, detects it, and moves on recursively. 
This pulse-to-pulse feedforward is what enables the re- 
ceiver to jointly process the information from all pulse 
slots collectively in a more efficient way than detect- 
ing each slot individually. Fig. [TJc) plots the average 
codeword discrimination error rates for 4-ary PPM code- 
words {|a)|0)|0)|0),|0)|a)|0)|0),|0)|0)|a)|0),|0)|0)|0)|a)} 
using DD and CPN, and compares them to the Helstrom 
limit. At high Np the CPN error attains an error rate 
scaling ~ e~'^^p matching the Helstrom bound while DD 
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FIG. 2: The experimental diagram of our CPN receiver im- 
plementation. EOMl carves 50 ns pulses spaced by 1 /is and 
E0M2 (E0M3) implements the pulse pattern for the signal 
PPM codewords (nulling pulse sequence). The nulling pulse 
decision circuit is implemented via an FPGA. The example 
pulse sequences shown in the two arms represent a signal 
transmitting codeword 2 and the corresponding most prob- 
able nulling pattern for high Np. 

attains ~ e~^p. We recently found two improved ver- 
sions of Dolinar's exact pulse nulling CPN receiver. One 
uses optimized nulling amplitudes in a similar manner to 
the Generalized Kennedy receiver described above. The 
dashed red curve in Fig. [TJc) shows the improved perfor- 
mance over that of the exact nulling CPN receiver, partic- 
ularly at low photon numbers [9]. We implemented both 
the exact and optimized nulling receivers, as described 
below. The other improvement, not implemented here, 
is obtained by applying an optimal amount of squeezing 
(phase-sensitive amplification) on the pulses to further 
manipulate the quantum noise statistics upon detection. 

Our experimental setup is shown in Fig. [2j Frames of 
four 50 ns flat-top pulses in each of the four pulse slots 
are carved from a polarized CW laser beam (A =688 nm) 
by an electro-optic amplitude modulator (EOMl). Pulse 
slots within PPM codewords are spaced by 1 /is. Each 
is split at a 50/50 beamsplitter and sent into the "signal 
arm" and the "nulling arm" where the PPM signal is en- 
coded, and the conditional feedback is prepared, respec- 
tively. Each 4-ary codeword is spaced by 6 /is. The pulse 
configurations for the signal (nulling) arm are prepared 
independently by using E0M2 (E0M3) in each arm as a 
shutter to select which pulses in the four-pulse train are 
transmitted. In the signal arm, one of the four pulses is 
transmitted by E0M2 to encode the desired PPM code- 
word. In the nulling arm, a field programmable gate 
array (FPGA) implements the CPN decision tree shown 
in Fig[TJd) and controls the transmissivity of E0M3. A 
Xilinx Spartan6 FPGA implements the 3-state state ma- 
chine of the decision tree, monitoring the single photon 
detector output, and controlling the optical feedback in 
the nulling arm for the subsequent pulse slot conditioned 



on the presence or absence of a detection event in the 
current pulse slot. The resulting optical fields are com- 
bined on a 99/1 beam-splitter with high transmissivity 
in the signal arm. To maintain fully destructive interfer- 
ence between the two fields at the beam-splitter, a piezo- 
driven mirror is placed in the signal arm to adjust the 
signal arm path length relative to the nulling arm. The 
resulting field is directed towards a silicon avalanche pho- 
todiode single photon detector with detection efficiency 
T] = 0.40, and dark count rate 16 counts/sec. A detec- 
tion event during a pulse slot triggers the decision tree to 
move to its next branch and either provide nulling to the 
next pulse slot or not. A representative example of trans- 
mitting codeword 2 and an associated nulling sequence 
is shown in Fig. [2] 

For each of the codewords, we input a set of 832 frames 
and decoded them using both DD and the exact nulling 
CPN receiver. For each PPM frame, the click record 
over the four slots was used to choose the most likely 
codeword. In certain cases, more than one outcome is 
equally probable (both due to fundamental Poisson noise 
and errors due to dark clicks and imperfect mode match- 
ing of the null and signal), in which case the decoder 
randomly chooses between words that are equally proba- 
ble. We then compared the receiver-generated hypothesis 
with the known input state to calculate the sample-mean 
probability of error in each case. This was done for a se- 
ries of different pulse mean photon numbers Np^ which 
was controlled by adjusting a neutral density filter in 
the signal arm (see Fig. [2| and measured by processing 
the click records of the DD data. The results are plot- 
ted in Fig. [3]^ a) versus the signal pulse photon number 
(normalized to include the measured quantum efficiency 
T] = 0.40). We see that the CPN receiver out-performs 
DD over a large range of Np. The largest fractional im- 
provement of 1.6 dB over DD is observed at Np = 1.25 
photons. At higher Np, the CPN error rate becomes 
worse than that of DD, unlike what is predicted by ideal 
theory. The prediction under ideal conditions is shown 
as the dashed red curve and scales like the Helstrom limit 
at high Np. 

To understand the deviations from this ideal perfor- 
mance we constructed a model which accounts for the 
dark and background counts and possible mode and 
phase mis-match between the nulling and signal pulses. 
The dark and background counts per slot were mea- 
sured from the direct detection data and additional data 
taken with the nulling arm on and signal completely 
blocked. We found that these counts were dominated 
by leakage from imperfect suppressions in the EOMs 
and that a probability of dark and background click, 
^d+bg = 0.0042A/p + 0.01297Vnuii, gave the best fit to 
the data. Here A^nuii = /^^ denotes the mean photon 
number of the nulling pulse. Accounting for this in the 
model leads to a small correction in the error probabil- 
ity for the DD points at higher Np. This was also in- 
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FIG. 3: (a) The probability of error for DD (blue dots) and 
exact nulling CPN (red dots). Vertical error bars are statisti- 
cal errors due to finite number of error events out of the 832 
frames (some data points were taken more than once result- 
ing in smaller error bars) . Horizontal error bars are calculated 
from measured intensity fluctuation levels. The dashed curves 
show the ideal calculated performance of the two receivers, in 
the absence of dark ("d") and background ("bg") counts, and 
any mode ("Am") or phase ("A^") mis-match. The solid blue 
curves show a DD model with Pd+bg = 0.0042Arp+0.0129iVnuii 
while the solid red curve is a CPN model with this Pd+bg and 
with Am + A6» = 0.05. The magenta dots show the experimen- 
tal results obtained by optimizing the nulling photon number 
iVnuii at several different Np values, (b) A nulling photon 
number (A^nuii) sweep holding Np = 0.64 constant. The solid 
curves use the same model as in (a), but the mis-match pa- 
rameter is chosen to be Am -\- Aq = 0.03 to give the best fit 
to the data. 



eluded in our model of the CPN receiver and gave sim- 
ilarly small corrections. A more significant effect arises 
from the fact that the signal and nulling pulses do not 
perfectly interfere. This modifies the Poisson statistics of 
the detection events. Most importantly, when attempt- 
ing to exactly null a pulse, with /3 = a, a pulse with 
mean photon number AmNp impinges on the detector 
rather than vacuum (where is the fractional mode 
mis- match). We believe imperfect polarization from our 
optical elements to be the largest source of such mis- 
match in our current set-up. We estimate temporal mis- 
match to be less than 1 %. A phase difference between 



the signal and nulling pulses in the two arms of the in- 
terferometer leads to an equivalent modification of the 
statistics, where the effective fractional mode mis-match 
parameter is Aq = 2(1 — cos^). We kept A^ + Agi as a 
free parameter and found that choosing this to be 0.05 
gave fairly good agreement with the observed data points 
(plotted as the solid red curve in Fig. [2];a)). We found 
that for any finite mis-match, the CPN receiver perfor- 
mance begins to degrade at some finite Np. We expect — 
with some straight-forward improvements — to be able to 
achieve A^ + Ag* = 0.005, in which case the improvement 
over DD would be 4.6 dB at A/p = 2 photons. 

In the current implementation, no active stabilization 
of the interferometer arms has been implemented, since 
careful shielding and vibration isolation allow the system 
to remain stable (to within 3^) for the order of seconds, 
more than the ~ 0.5 s required to obtain 832 frames for 
each data point. However, the system is designed to ac- 
commodate active phase stabilization, in particular, by 
isolating the signal laser (from a backward-propagating 
servo laser used for active stabilization) for the detector 
using multiple bounces on a pair of wavelength-sensitive 
dichroic mirrors. We note that in a realistic communi- 
cation system the nulling field will not derive from the 
same laser as the signal and must be actively stabilized 
relative to the signal field's phase and mode. 

To further improve the error rates, we next imple- 
mented and explored the optimized nulling version of 
the CPN receiver [9] by holding Np constant and varying 
^nuii- An example with Np = 0.64 is plotted in Fig.|3|b). 
Here we observed an error rate of 0.24 at the optimal 
nulling value of A'nuii = 1.2 photons, which is significantly 
better than the error rate obtained with exact nulling 
(A^nuii = A/p) and a 2.1 dB improvement over DD. This 
highlights the importance of the optimal nulling general- 
ization of the CPN receiver at lower Np and is analogous 
to the improvement the Generalized Kennedy receiver ex- 
periences. We performed nulling photon number sweeps 
at several Np and plot the optimal error probabilities as 
magenta points in Fig. jsja). We observed a 2.2 dB im- 
provement over DD at Np = 1.25 for the optimal value 
A^nuii = 1-4. We note that the data for the nulling photon 
number sweeps matched best with a somewhat smaller 
mode-mismatch parameter, A^ -\- Aq = 0.03 (slightly 
lower than the best match of 0.05 for our exact-nulling 
data set). 

In practice, very low coded bit error rates on commu- 
nunication links employing PPM are obtained by employ- 
ing an outer code — quite often an (n, /c, d) Reed Solomon 
(RS) code — whose code parameters {n = code length, 
2^ = number of codewords, d = minimum Hamming dis- 
tance) are chosen optimally based on the order M of 
the underlying PPM modulation and the mean photon 
number per PPM pulse, Np. In fact, a suitably cho- 
sen RS outer code has been shown to function close to 
the Shannon capacity of the PPM direct detection chan- 
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nel at low photon flux, with apphcations to deep space 
communication \2T . To investigate how our observed 
CPN codeword demodulation error rates can translate 
into improved coded error performance, we performed 
error calculations using a standard formula for minimum 
distance decoding of RS outer codes [22|. Taking our 
measured optimal nulling CPN and DD performance at 
Np = 1.25, we calculated the minimum code length nmin 
required to bring the coded error rate below 10~^^ as a 
function of RS code rate r = k/n, which corresponds to 
the rate of information communicated per pulse. The 
results corresponding to both the ideal theoretical per- 
formance (dashed curves) and our observed error rates 
(solid) are plotted in Fig. [I] One subtlety here is that 
it is optimal for the inner code (PPM) demodulator to 
record measurement results that are equally likely to cor- 
respond to any of the M possible PPM codewords as 'era- 
sures' rather than making a hard codeword decision (and 
most likely making a error) - as the RS outer code can 
more effectively correct erasures than errors. Interest- 
ingly nearly all of the DD receiver's incorrect decisions 
are recorded as erasures (taken to be instances where 
no clicks occur in any slot) and as a result at low rates 
r < 0.5 the DD receiver actually requires slightly smaller 
code lengths than the ideal CPN receiver, which records 
more errors and few erasures. However, it quickly begins 
to perform worse than the CPN at r > 0.5 and as the rate 
approaches the Shannon rate of the DD receiver, its nmin 
quickly grows to > lOx larger that of the CPN receiver. 
There is furthermore a range, between the Shannon rates 
of the two receivers, where the coded error rate cannot 
be made small for the DD receiver, while it can be for the 
CPN receiver. The solid curves show that a good fraction 
of this potential advantage can already be realized with 
the error and erasure rates observed in our experimen- 
tal demonstration. This indicates a big potential savings 
in the coding complexity of a typical deep space laser 
communication system. Alternatively, for a given coding 
complexity, the CPN receiver could achieve the desired 
coded error rate with smaller power or over larger range. 

Recent theoretical results reveal the significance of 
receivers that use sequential decoding and quantum 
feedforward — the hallmark of the CPN receiver — for 
achieving the Holevo capacity of optical communication. 
Recently, Lloyd et al. and Tan showed that a sequential 
decoding approach on a random codebook can achieve 
the ultimate Holevo capacity for any quantum chan- 
nel [TT1[23]. More recently, using Sen's non-commutative 
union bound for sequential-decoding receivers [26] , Wilde 
has shown that a combination of codeword nulling (which 
we have implemented herein), along with a "vacuum or 

not" QND measurement (||0)(0|, /m - |0)(0||) suffice 

to attain the Holevo capacity [24l [25] . The decoder has 
the receiver ask, through a series of dichotomic von Neu- 
mann measurements, whether the output of the channel 
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FIG. 4: Minimum block length rimin of a (n, /c, c?) Reed 
Solomon outer code which attains a coded block error prob- 
ability Pe — 10~^°, as a function of code rate r = k/n. 
We compare the results of outer coding (over errors and era- 
sures generated by the receiver acting on the inner code) for 
a M = 4 PPM inner code with Np — 1.25 mean photon 
number per pulse, both for DD (blue plots), and the CPN re- 
ceiver (red plots) with optimum nulling A^nuii = 1.6 photons. 
The dashed curves correspond to the ideal theoretical per- 
formance, corresponding to the DD receiver's error and era- 
sure probabilities, Pi°°^ = 0, Pem?^ = 0.289, and the CPN 



receiver's error and erasure probabilities, Pi 



,(CPN) 



0.082, 



Pe?^^^ — 0.011. The solid curves correspond to our ex- 



perimentally demonstrated error and erasure rates for the 
two receiver strategies, Pi^^^ = 0.004, PiS?^ = 0.287 and 
p^(CPN) ^ 0.092, Pira?^^ = 0.052 respectively. An erasure for 
the CPN receiver occurs in the event of a click in all four time 
slots (the lower branch in all four steps of the decision tree in 
Fig. mb)), which has zero probability of occurrence for ideal 
exact nulling with no dark or background counts. Fortunately, 
this erasure outcome is the most common incorrect decision 
caused by signal-null mode-mismatch, which is an easier kind 
of error to correct by the outer code (than a hard decision 
error), made by the inner decoder. The vertical lines indicate 
the Shannon rates for each of the four cases plotted, at which 
the rimin required shoots to infinity, and above which rate an 
outer-code-based error correction becomes ineffective (due to 
the converse of Shannon's noisy channel coding theorem |17]V 



was the first codeword, the second codeword, etc. This is 
quite similar in spirit to the CPN receiver, except that it 
requires quantum non-demolition (QND) "yes-no" binary 
projective measurements after each nulling stage, while 
the CPN receiver annihilates the quantum state of each 
pulse slot as it progresses through the codeword. There- 
fore, figuring out how to do the "vacuum or not" mea- 
surement non destructively, and substituting it for the 
SPD in our CPN setup while using an optimal code (in 
place of PPM), would suffice to attain the Holevo limit. 
Furthermore, we recently found the first near-explicit op- 
timal code, one that achieves the Holevo capacity, called 
the quantum polar code^ whose receiver measurement also 
works via a sequence of QND binary projective measure- 
ments, however with an exponential savings on the total 
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number of measurement stages required compared with 
the codeword-nuhing plus vacuum-or-not-measurement 
receiver [12]. 

In summary, we have demonstrated the first optical 
communications joint detection receiver, the CPN re- 
ceiver, which performs a joint measurement over PPM 
codewords, and have shown demodulation probability of 
error improvement over the traditional direct detection 
receiver. We have further demonstrated the optimized 
nulling generalization of this receiver, and with it ob- 
served up to 2.2 dB error rate improvement over DD, 
the largest improvement over SQL reported to date for 
an optical receiver. The pulse photon number regime for 
which we have observed this gain is relevant to deep space 
optical communication links and we have calculated that 
in such a system, our observed improvement can trans- 
late into big savings in coding complexity. Finally, our 
results are an important step in the ongoing quest to de- 
sign and implement joint detection receivers, which can 
eventually attain the ultimate Holevo limit for reliable 
communication of classical data over an optical channel. 
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